Text2LIVE: Text-Driven Layered Image and Video Editing
نویسندگان
چکیده
We present a method for zero-shot, text-driven editing of natural images and videos. Given an image or video text prompt, our goal is to edit the appearance existing objects (e.g., texture) augment scene with visual effects smoke, fire) in semantic manner. train generator on internal dataset, extracted from single input, while leveraging external pretrained CLIP model impose losses. Rather than directly generating edited output, key idea generate layer (color+opacity) that composited over input. This allows us control generation maintain high fidelity input via novel losses applied layer. Our neither relies nor requires user-provided masks. demonstrate localized, edits high-resolution videos across variety scenes. Webpage: http://www.text2live.github.io .
منابع مشابه
Text-image Coupling for Editing Literary Sources
Users need more sophisticated tools to handle the growing number of image-based documents available in databases. In this paper, we present a system devoted to the editing and browsing of complex literary hypermedia including original manuscript documents and other handwritten sources. Editing capabilities allow the user to transcribe manuscript images in an interactive way and to encode the re...
متن کاملEditing out Video Editing
paradigm shift in media production: the advent of computational media production that will automate the capture, editing, and reuse of video content. By integrating metadata creation and (re)use throughout the media production process, we’ll enable the mass customization of video. F or the majority of people to not just watch but make video on a daily basis, the current media production process...
متن کاملBayesian Scheme for Interactive Colourization, Recolourization and Image/Video Editing
We propose a general image and video editing method based on a Bayesian segmentation framework. In the first stage, classes are established from scribbles made by a user on the image. These scribbles can be considered as a multimap (multilabel map) that defines the boundary conditions of a probability measure field to be computed for each pixel. In the second stage, the global minima of a posit...
متن کاملDesign Issues for Line-Driven Text Editing / Annotation Systems
Recent research on interfaces driven by line-markings indicates that there are many potential benefits and applications of such interfaces. Benefits include the exploitation of users' handwriting skills and their skills in understanding handwritten marks. There are systems that have exploited one or the other of these benefits but not both. One application which would take advantage of both of ...
متن کاملAdvanced editing methods for image and video sequences
In the context of image and video editing, this thesis proposes methods for modifying the semantic content of a recorded scene. Two different editing problems are approached: First, the removal of ghosting artifacts from high dynamic range (HDR) images recovered from exposure sequences, and second, the removal of objects from video sequences recorded with and without camera motion. These editin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-19784-0_41